KWSim: Concepts Similarity Measure
نویسندگان
چکیده
The comparison of manually annotated medical images can be done using the comparison of keywords in a lexical way or using the existing medical thesauri to calculate semantic similarity. In this paper, first we introduce the KWSim measure, a fully automated technique of measuring semantic similarity by mapping concepts(keywords) to different medical thesauri and examining the “is-a” relation type. A keyword vector similarity is also presented, based on the KWSim measure. Our approach is implemented using MeSH, ICD-10 and SNOMED CT thesauri and compared with two other existing approaches. We illustrate our method with a real time online annotation assistant. RÉSUMÉ. La comparaison des images médicales annotées manuellement peut être réalisée grâce à une comparaison lexicale entre des mots-clés ou en utilisant des thésaurus médicaux existants pour calculer une similarité sémantique entre ces mots. Dans cet article, nous présentons tout d’abord la mesure KWSim, une technique entièrement automatisée pour le calcul de la similarité sémantique en mappant des concepts (mots-clés) aux différents thésaurus médicaux et en examinant le type de relation « is-a ». Une similarité entre les vecteurs de mots-clés est également présentée, basée sur la mesure KWSim. Notre approche est implémentée en utilisant MeSH, ICD-10 et SNOMED CT thésaurus et comparée avec deux autres approches existantes. Nous illustrons notre méthode avec un assistant d’annotation en ligne et en temps réel.
منابع مشابه
خوشهبندی اسناد مبتنی بر آنتولوژی و رویکرد فازی
Data mining, also known as knowledge discovery in database, is the process to discover unknown knowledge from a large amount of data. Text mining is to apply data mining techniques to extract knowledge from unstructured text. Text clustering is one of important techniques of text mining, which is the unsupervised classification of similar documents into different groups. The most important step...
متن کاملAn improved similarity measure of generalized trapezoidal fuzzy numbers and its application in multi-attribute group decision making
Generalized trapezoidal fuzzy numbers (GTFNs) have been widely applied in uncertain decision-making problems. The similarity between GTFNs plays an important part in solving such problems, while there are some limitations in existing similarity measure methods. Thus, based on the cosine similarity, a novel similarity measure of GTFNs is developed which is combined with the concepts of geometric...
متن کاملA Novel Image Structural Similarity Index Considering Image Content Detectability Using Maximally Stable Extremal Region Descriptor
The image content detectability and image structure preservation are closely related concepts with undeniable role in image quality assessment. However, the most attention of image quality studies has been paid to image structure evaluation, few of them focused on image content detectability. Examining the image structure was firstly introduced and assessed in Structural SIMilarity (SSIM) measu...
متن کاملA New Ontology-Based Semantic Similarity Measure for Concepts Subsumed by Multiple Super Concepts
Semantic Similarity relates to computing the similarity between concepts of ontology. There exist four approaches to calculate the semantic similarity. The first approach is based on path length. Under this approach we studied and compared some of the measures on a bench mark dataset. Among the compared measures Wu & Palmer measure has the advantage of being simple to implement and has better p...
متن کاملInformation Content Based Semantic Similarity Measure for Concepts Subsumed By Multiple Concepts
85 ABSTRACT: Semantic Similarity relates to computing the similarity between different ontological concepts .Various categories of semantic similarity measures have been proposed to determine how similar are they between any two concepts within ontology. Information Content (IC) measure is one such among the category of measures. We analyze different methods of calculating the IC value for any ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2008